ANOMALIES IN THE ANALYSIS 
OF CALIBRATED DATA 

D. R. JENSEN AND D. E. RAMIREZ 

Abstract. This study examines effects of calibration errors on model assumptions and 
data— analytic tools in direct calibration assays. These effects encompass induced depen- 
dencies, inflated variances, and heteroscedasticity among the calibrated measurements, 
whose distributions arise as mixtures. These anomalies adversely affect conventional in- 
ferences, to include the inconsistency of sample means; the underestimation of measure- 
ment variance; and the distributions of sample means, sample variances, and Student's t 
as mixtures. Inferences in comparative experiments remain largely intact, although error 
mean squares continue to underestimate the measurement variances. These anomalies 
are masked in practice, as conventional diagnostics cannot discern irregularities induced 
through calibration. Case studies illustrate the principal issues. 



1. Introduction 

Calibrated measurements, intrinsic to the sciences and engineering, are inherently subject 
to errors of calibration. These errors induce dependencies in violation of a basic tenet in 
much of applied statistics, namely, that observations should be uncorrelated if not indepen- 
dent. These issues traditionally have been overlooked by both scientists and statisticians, 
despite a century of emerging methodologies for the analysis of experimental data. Not 
only are many parametric and nonparametric procedures at risk under such violations, but 
so also are conventional diagnostics for checking critical features of a model. We return to 
these subsequently. 

To fix ideas, observed responses {Zq, Z\, . . . , Z n } are often adjusted to Zq as stan- 
dard, giving differences {Yi = (Zi — Zq);1 < i < n} as the objects of interest to the 
investigator. Moreover, if {Zq, Z%, . . . , Z n } are mutually uncorrelated having variances 
{Var(Z ) = (Tqj Var(Zi) = a 2 ; 1 < i < n}, then {Yx, . . . ,Y n } are equicorrelated with 
parameter p = (Jq/{(t 2 + a 2 ), having variances {VariYi) = a 2 + (7q; 1 < i < n} that are 
inflated in comparison with unadjusted values. 

Linearly calibrated instruments are pervasive. Some unintended consequences, to be 
examined here, include (i) the structure of induced dependencies, heteroscedasticity, and 
other departures from conventional model assumptions; (ii) the inflation of measurement 
variances in comparison with intended values; and (iii) effects of calibration on conventional 
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inferences for location, scale, and model diagnostics. We first examine moments, then effects 
of calibration on the actual measurement distributions themselves. We focus here on direct 
calibration assays to be identified subsequently. An outline follows. 

Section 2 gives notation and technical support. Section 3 reconsiders the calibration pro- 
cess with reference to irregularities induced through calibration errors. Section 4 addresses 
the impact of these irregularities on conventional inferences, to include (i) inferences regard- 
ing the mean and variance in a single sample; and (ii) the analysis of one-way experimental 
data, including tests on means and variances. The latter remain largely intact, although 
measurement variances continue to be underestimated. Section 5 reexamines the ability of 
conventional diagnostics to uncover violations induced through calibration. Section 6 un- 
dertakes a case study to illustrate essential findings. Section 7 offers a brief summary and 
cautionary note. Some peripheral matters are deferred to an Appendix. 

2. Preliminaries 

2.1. Notation. Designate R n as Euclidean ra-space, R™ as its positive orthant, §„ as the 
real symmetric (n x n) matrices, and §+ and S° as their positive definite and positive 
semidefinite varieties. Arrays appear in bold type, to include the transpose A' and inverse 
A^ 1 of A; the unit vector 1„ = [1, . . . , 1]' e R"; the identity matrix I n ; a block-diagonal 
matrix Diag{A\ 1 . . . , Ak); and B n = (I n — n _1 l„l4). Following Loewner (1934), matrices 
(A, B) in §„ are said to be ordered as A >l B for A — B e S°, with A B whenever 
A — B e §+. Moreover, C(n) comprises the convex sets in R n symmetric under reflection 
through <G R™. Operators E(Y) and V(Y) designate the expected vector and dispersion 
matrix for Y € R™, with Var(Y) as the variance on R 1 . We further require {fj, r (Z) = 
E(Z r );r = 1,2} as moments about G R 1 , identifying k 2 = ^2(^1) m t erms of a linear 
estimatot [3 l to be encountered subsequently. The comparative concentration of probability 
measures on R" may be gauged on defining the measure ^(-) to be more peaked about 
<G R" than v(-), if and only if ji{A) > v(A) for every set A e C(n), as in Sherman 
(1955). Specifically, the peakedness ordering for scale mixtures of Gaussian measures on R™ 
is tantamount to the stochastic ordering of their mixing distributions, as demonstrated in 
the Appendix. 

2.2. Special Distributions. Here pdf and cdf refer to probability density and cumulative 
distribution functions; for Y e R", CiY) designates its law of distribution and G(y) its cdf; 
and iid refers to independent and identically distributed random elements. Distributions of 
note on R 1 include the Gaussian law Ni(/i, a 2 ), with parameters (^, cr 2 ); noncentral versions 
of Student's t{y, A) and t 2 (^,A), chi-squared x 2 (^,A), and Snedecor-Fisher i^(^i,^ 2 ,A) 
distributions, with {^,^1,^2} as degrees of freedom and A as a noncentrality parameter; 
and Go(a,/3) as the gamma distribution on R^_ having parameters (a, (3). In particular, 
gr(t] v 1 A), gr 2 { u : v, A), gF{u; ^1, ^2, A), and go{u; a, (3) designate the densities corresponding 
to t(v, A), t 2 {v, A), F(^i,j/ 2 ,A), and Go(a,/3), respectively. 
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To continue, N n (0, X) designates the Gaussian law on W 1 , and g n (y; 9, X) its pdf, having 
location-scale parameters (9,11). Ensembles on R", and mixtures over these, include the 
translation-scale mixtures 

/oo 
ffn (y;0(t),E(t))dGi(i), (2.1) 
-oo 

and purely scale mixtures when = 0. Nonstandard distributions for quadratic forms 
proceed conditionally on letting C(U \ w) have the scaled gamma density g (u;a,w{3) = 
{w[})~ a u a ~ 1 e~ u / wl3 /Y(ot), then compounding these as 

..a— 1 /■oo 

f 3 (u;a,(3,G 3 ) = —— vj- a e-*/^dG 3 (w) (2.2) 
P 1 ("J Jo 

with G3(w) as a cdf on R^_. 

Subsequent developments have links to exchangeable sequences. Consider {Zq, Z\, Z%, . . .} 
such that Z e K fe is independent of {Z\, Z2, . ■ .}, whereas {Zi, Z 2 , ■ ■ ■} are iid on R°°. Fur- 
ther let V : R fe+1 ^M 1 ; define {Y; = V(^o, Z*); i = 1, 2, . . .}; recall from DcFinctti's theorem 
that the sequence {Y\, Y 2 , . . .} is now exchangeable on M°°; and infer that joint distributions 
projected onto M™ are invariant under permutations. In short, {Yi, . . . , Y n } are iid, under 
second moments having common values for the parameters (/i, <J 2 ,p). 

3. Calibration 

3.1. Essentials. Instruments are calibrated using outputs at successive inputs to determine 
a calibration curve; new readings are assigned values on the scale of measurements using 
the calibrated device; and periodic checks against a standard determine when recalibration 
is required. In this study we utilize direct assays in which instrumental readings {Xi; 1 < 
i < n } during calibration relate to observed measurements {L^; 1 < i < n } through {?7, = 
/3 +/3 1 X i + e i ; 1 < i < n }. For example, the octane rating (£7) in the production of gasoline 
relates linearly to the percent of purity (X) in a specimen to be assayed. Octane numbers 
require expensive and time-consuming dynamic laboratory testing, whereas the percent 
purity is readily determined. Once calibrated, the octane number of a given specimen is 
determined vicariously from its percent purity. On the other hand, indirect assays proceed 
on reversing the roles of Ui and Xi during calibration. Models for calibration and their 
analyses have been debated by several authors; for a summary and early references see 
Krutchkoff (1971). Problems with moments and consistency remain to be resolved in indirect 
assays, but the technical issues between the two types differ mainly in detail. It is noteworthy 
that research has yet to address the principal issues undertaken here, namely, irregularities 
in models and supporting analyses attributable to calibration. 

3.2. Error Analysis. To continue, consider the calibrating model {Ui = /3 + fiiXi + 
1 < i < "0} under Gauss-Markov assumptions, such that {Var(Ui) = a\j\ 1 < i < no} 

and (/3p, p\) are least-squares estimators determining the empirical calibration line. Under 
Gaussian calibration, the calibration errors {ef, 1 < i < n } comprise iid Ni(0, af,) random 
variables. Subsequent readings {Z\, . . . , Z n }, taken independently of {U\, U no }, are 
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then projected as the calibrated measurements {Yi = [3 Q + /3 1 Zf, 1 < i < n}. In practice 
{Z\, . . . , Z n } often will have been discarded as redundant, or will have been converted 
directly without record, so that {Yi, . . . , Y n } remain to be analyzed and interpreted. If we 
now suppose that Z' — [Z\, . . . , Z n ] have means /x' = \fj, 1 , . . . , /j, n ] and second moments V(Z) 
= £ = [(Jij], independently of (/3 , P x ), then conditional moments of £(Y\, . . . , Y n | /3 X ) are 
found directly as follows. 

Lemma 1. Suppose that \Z\, . . . , Z n } have means {fJ,i, ■ ■ ■ , n n } and second moments V(Z) 
= [o~ij], independently of (/3 , /3x), and let {Yi — f3 Q + /3 1 Zi; 1 < i < n}. Then 

(i) E(Yi\0J = Etf^fc) 

(ii) Var(Y t | = plla H + Var0 o | &); and 
(in) Cov{Y t , Yj | ft) = plo-ij + Var0 Q \di)- 

If instead (/3 , ft) were known, then {Yi — (3 Q +(3 1 Z i ; 1 < i < n} would be recovered without 
error, in which case E{Y t ) = ft, +ft/^ and Var{Yi) = (3\Var{Zi). This ideal case serves as 
reference against which recovery subject to calibration errors may be gauged. 

Expressions simplify when neither E(f3 \ ft) nor Var([3 | ft) depends on ft, so that 
Cov(Yi,Yj | ft) = fliCTij + Var((3 ), for example. This clearly holds under Gaussian calibra- 
tion, provided that the calibrating readings {Xi, . . . ,X no } have been centered to {{X\ — 
X), . . . , (X no — X)}. This incurs no loss in generality, as subsequent readings {Zi, . . . , Z n } 
may be shifted by X units before projecting onto the scale of measurements. It then fol- 
lows that ft = U; Var(/3 ) = a\ = afj/no; and yar(ft) = a\ = a\jjS xx , where S xx — 
Sr=i(^ — X) 2 : so that (ft, ft) are uncorrelated and thus independent under Gaussian 
calibration errors. We henceforth take the initial calibration to have been centered. 

We next consider conditional and unconditional properties of {Yi = ft + ft-Zj; 1 < i < 
n} for the general case that E(Z) = fi z E R™ and V(Z) = X £ §+, to be specialized 
subsequently. Clearly the conditional means and dispersion parameters are E(Y | ft) = 
ftl„ + P 1 fi z = H Y 0i), sa Y: and V ( Y I Pi) = 3i s + o'o 1 " 1 ™ = 3 (3i)- Moreover, for the 
case that C(Z) = N n (/j, z , S) in addition to Gaussian calibration errors, then the conditional 
distribution of Y is C(Y\P 1 ) = 7V„(/Xy (ft), E(ft)). Basic unconditional properties follow 
next. 

Theorem 1. Consider calibrated measurements {Yi — ft + f3 1 Z i ; 1 < i < n} projected from 
readings {Z\, . . . , Z n } obtained independently of (/3 ^ ) 5 ^ Y = fio^-n + ft-Z", such that 
E(Z) = ii z el" and V(Z) = S e §+; and let a% = Var(/3 ) and u\ = Var(ft). Then 
unconditional moments E(Y) = fi Y and V(Y) = H of C(Y) are given by 
(%) Hy = ftln + PiVz, and 

(ii) S = k 2 S + o-\\ n \' n + o-\n z n' z , with n 2 = H 2 0i) = o~\ + 

(Hi) Moreover, if C(Z) = N n (p, z ,Ti) in addition to Gaussian calibration errors, then the 
unconditional joint density of the elements of Y is the translation-scale mixture 

/oo 
g n (y,H(t),E(t))dGi(t) (3.1) 
-OO 
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as in (2.1), with n(t) = (3 Q l n + t\x z , E(i) = t 2 E + (T§1„1^, and with mixing distribution 
Gi(t) = JVi(/?i,<7?). 

Proof. Conclusion (i) follows directly through deconditioning. Conclusion (ii) follows using 
Var(Y t ) = Eg [VariYilfiJ] + Var^ (E(Yi for variances, and 

Cov(Y i ,Y j ) = Eg i [Cov(Y u Y j \d 1 )]+Cov A [E(Y i 1 ), £7(1^ 1 3i)] 

for covariances. Conclusion (iii) follows since Y is a linear function of (Z, (3 ) with /3 1 fixed, 
so that £(V — N n (n Y (0i)i ^(/^i))> as noted, and then mixing over the distribution of 
the conditioning variable. □ 

The foregoing results are basic. We next specialize them as appropriate for specific 
experimental settings encountered routinely in practice. 

4. Topics in Inference 

Induced dependencies and other model irregularities violate the tenets of conventional 
data analysis as noted, specifically, in estimation and hypothesis testing. We focus on 
normal-theory procedures, as the independence typically required by nonparametric com- 
petitors is conspicuously absent. The following sections specialize earlier findings, as they 
apply in a single sample, and in one-way comparative experiments. 

4.1. Single Sample. Consider {Yi — j3 Q + (3 1 Z i ; 1 < i < n} such that elements of Z = 
[Z u . . . , Z n ]' satisfy {E(Z t ) = n z ; 1 < i < n} and V(Z) = a 2 z I n . If in addition {Z 1 , Z n } 
are iid, then C(Yi, . . . , Y n ) is exchangeable, as noted earlier. We are concerned not only with 
properties of the joint distribution C(Y), but also of (Y,S Y ,t ) as the sample mean, the 
sample variance, and Student's statistic t = n 1 / 2 (Y" — /j, y )/Sy, as well as the ordinary 
residuals {Ri = (Yi — Y); 1 < i < n}. From Lemma 1 and Theorem 1 we now have the 
conditional and unconditional values {E(Yi \(3 l ) = /3 + (3 1 fi z ; 1 < i < n}; {E{Y{) = [i Y = 
00 + PiHz\ and V(Y \ = H(^) = $\a 2 z + a 2 )[(l - p)I n + pl n \' n ], with p = 
a o/(Pi (T 'z + <J o)- The unconditional variances are homoscedastic, namely, {Var(Yi) = a Y = 
K 20~ z + &o + < J iL l ' Z ', 1 < i < n}. Essential findings follow, where it is seen that S Y may grossly 
underestimate the actual measurement variance a Y , and that structural difficulties becloud 
both the small-sample and the asymptotic properties of Y n = (Yi +Y 2 + . . . + Y n )/n. 

Theorem 2. Let {Yi = (3 +(3 1 Z i ;l < i < n} be calibrated measurements from {Z 1} . . . , Z n }, 
such that E(Z) = [i z l n and V(Z) = a z I n independently of (/3 , and consider the 
sample quantities (Y n , S Y ), together with the ordinary residuals {Ri = (Yi — Y); 1 < i < n}. 
Then 

(i) Y n is unbiased but inconsistent for estimating fi Y = f3 + PifJ. z ', 

(ii) E(S Y ) = K 2 a% = o\(o\ + and 
(in) {E(Ri) = 0; 1 < i < n}. 
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Proof. The unbiasedness of Y n follows routinely, and its variance from Var(n 1 l' n Y) = 
S„ with 

S n = n- 2 l' n [ K2 a 2 z I n + (a 2 + a\n 2 z )\ n l' n ]l n (4.1) 
= n- 1 K 2 a z + {al + a 2 iJ? z ). 

Since lim n ^ 00 Var(Y n ) = (erg + CiMz) > 0, it follows that its limit distribution is nondc- 
generate at [i Y , so that Y n is consistent neither in probability, nor in mean square, nor 
almost surely, as asserted. Conclusion (ii) follows from evaluating the expected value of the 
quadratic form (n - 1)S Y = Y B n Y as E[(n - l)S Y ] = trB n V(Y) + n' Y B n \x Y . Details 
are 

E[(n - l)S Y ] = tr B n [K 2 a 2 z I n + (a 2 + a 2 fi 2 z )l n l' n }B n + ii! Y B ni i Y 
= (n - 1)k 2 <j 2 z 

where fi' Y B n fi Y = (f3 + (3 1 /j, z ) 2 l' n B n l n = 0, since B n is idempotent and B n l n = 0. 
Conclusion (iii) follows directly, to complete our proof. □ 
The following consequences may be noted. 

• Conclusion (i) appears to dash the usual expectation that lengths of (1 — a) confi- 
dence intervals for [i Y will decrease at the rate 0(n -1 / 2 ). 

• The sample variance S Y underestimates the actual variance a Y . The bias is E{S Y ) 

- <4 = -K + cr iM|)- 

• This bias increases with decreasing precision in estimating the calibration line, and 
with increasing \n z \ an d t nus | \x Y \ . 

• On the other hand, the expectation E{S Y ) — k 2 <j%, with k 2 — fJ, 2 (0i), may be 
compared with the ideal variance, Var(Yi) — f3\a 2 z , attained under linear calibration 
with known parameters (/3 ,/3i)- 

We have seen how unconditional moments of calibrated measurements depend on those 
of the conditioning variable fi l . It remains to examine effects of the fitted calibration line 
on unconditional distributions, to include those of various sample statistics. Recall from 
Theorem 2 and its proof that E(Y) = \i Y = Pq+P^z and Var(Y) = n^ 1 n 2 a 2 z + (al+(r\^ 2 z ). 
To invoke expression (2.1) and its special case at = 0, under Gaussian assumptions we 
have Gi(3i) = <?i), together with G 2 @ 1 ;\), such that C(fija\) = X 2 (l,^) with 

A = (3 2 /al- Basic unconditional distributions follow next as mixtures . 

Theorem 3. Let {Yi = [3 + (3 l Z i ;l < i < n} be calibrated measurements; suppose that 
{Zi,...,Z n } are iid Ni(fi z ,a z ) independently o/(/3 ,/3 1 ) under Gaussian calibration; and 
consider the sample quantities (Y n , S Y , t^), together with the ordinary residuals {Ri = (Yi — 
Y); 1 < i < n}, where t\ — n(Y — [i Y ) 2 /S Y for testing H : \i Y = /j, Y against Hi : fi Y ^ /j, Y . 
Then unconditional properties are as follows. 

(i) The unconditional density of L(Y) is the translation- scale mixture 

/oo 
ffi(u;/i(t),5„(t))dGi(i) (4.2) 
-oo 
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with mixing distribution Gi(/3 1 ) = N\{(3 1 ,a\) as in (2.1) for the case n = 1, where /x(i) = 
flo+tf-z and E n (t) = (t 2 a z /n + cr 2 ,), and 5„ is defined in (4-1). 
(ii) The joint density of residuals R = [Ri, . . . , R n }' is given by 

/>oo 

f n {r;0,a%B n ,G 2 ) = g n {r;0,ta 2 z B n )dG 2 (t) (4.3) 
Jo 

^2 

wifft mixing distribution G 2 ((3 1 ; A) = x 2 (l, A) and A = 0\/a\. 

(Hi) The joint distribution C(R\, . . . , R n ) increases in peakedness about e R" with 
decreasing A. 

(iv) C{vSyl 'o-%) has the density fo(u; 2, G 2 ) as in (2.2), with v = n — 1, mixing 

^2 

distribution G 2 (/3 1 ;A) = x 2 (l, A) and A = /3 2 /cr 2 ; and E(S Y ) = a\{a\ +/3 2 ). 
(V,) The distribution C(vS Y /a 2 z ) increases stochastically with A = (3\jd\. 

(vi) The unconditional density of t^ = n(Y — ii Y ) 2 j ' S Y is given as the mixture 

/•OO 

g(u;u,S,G 2 ) = g T 2(u;v,S/t)dG 2 {t) 
Jo 

^2 

with mixing distribution G 2 (f3 1 ; A), where v = n — 1, (5 = — n Y ) 2 /a 2 z , and A = . 

(vii) The unconditional cdf of £(io) increases stochastically with increasing S — {p Y ~ 
^) 2 /cr 2 7 for fixed A = 0\ja\\ and for fixed S, it decreases stochastically with increasing A. 

Proof. The conditional distribution of note is C(Y \ fa) = Ni((j,(fa), E n (fa)), where n(fa) — 

^ ^ ^2 

fa + fa[i z and S„(/3 1 ) = {fao 2 z /n + cr 2 ,). Its unconditional density thus is /i(it; /^.y, S„, Gi) 
from (2.1), to give conclusion (i) with mixing distribution as asserted. To continue, observe 

^ ^2 ^2 

that # = B n Y and J B„H(/3 1 ) J B„ = B n (faa%I n + all n l' n )B n = faa%B n since B n is 

^ ^2 

idcmpotcnt and B n l n = 0. We infer conditionally that £(R\(3 1 ) — N n (0, faa 2 z B n ), since 
B n fi(fa) = (f3 + fa/j, z )B n l n = 0. The unconditional distribution is the scale mixture as in 

^2 

conclusion (ii), with G 2 (fa, A) = x 2 (l, A) as the mixing distribution over [0, oo). Conclusion 

^2 

(iii) follows from Lemma A2 of the Appendix, as the mixing distribution C{fa/a\) = x 2 (l, A) 
increases stochastically with A = fa 2 ja\. To proceed, observe that (n — Y)S\ = R'R, so 
that £(R'R/faa 2 z | fa) = x 2 (n ~ 1,0). It follows that £[(n - l)S Y /a 2 z | fa] is a central 

^2 ^2 

chi-squared variate scaled by fa. On identifying (n — l)S Y /a z with [/ and /3 X with w in 
developments leading to (2.2), we thus establish conclusion (iv) on specializing from gamma 
to chi-squared distributions. Conclusion (v) follows directly from (iii) since the set A t = 
{R e K" : R'R < t} is convex and symmetric in M™, whereas P(A t ) = P[(n-1)S Y < ta\\. 

To see conclusion (vi), recall the afBne-invariance of t 2 = n(Y — n Y ) 2 / S Y for testing 
H : H Y = f-Y under {Yj — > a + bZf, 1 < % < n}, namely, t 2 = n[(Y - fj, Y ) + (fj, Y - )f I S Y 
= n{[(a + bZ) -(a + b» z )\ + b(^ Y - / b} 2 / b 2 S 2 = n[(Z - fji z ) + ( My - &)/b] 2 /S z . 
Clearly C(t 2 ) = t 2 (is,5), with v = n — 1 and S = (fi Y — /i y ) 2 / b 2 a 2 z , independently of a. 
We next apply these facts conditionally, given ((3 ,Pi), on replacing (a, b) with (/3 , 
to infer that C{t 2 \Ji 3i) = £(t 2 \ fa) = ^^,60,)), where 50,) - (jiy - lt x c\, 
independently of [3 . It follows that the unconditional distribution of t 2 has the mixture 
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density g(u\ v, 6, G 2 ) = gT 2 (u;v,5/u)dG 2 (u) with mixing distribution G 2 ((3 1 ;X) as as- 
serted, where 5 = (/j, y — n Y ) 2 /cr 2 z and A = (3 1 /cr 2 - To continue, the test for H a : [i Y = [i Y 
against H\ : [i Y 7^ H Y rejects at level a for t$ > (? a \ moreover, the conditional cdf £(tg \ (3 l ) 
increases stochastically with 5(/3 1 ), point -wise for each fixed /3 1 , from standard properties 
of noncentral ^-distributions. It follows that the unconditional cdf increases stochastically 
with increasing <5 under mixing. That the unconditional cdf decreases stochastically with 
increasing A, with S held fixed, follows unconditionally through mixing as in the proof for 
conclusion (v), to complete our proof. □ 

Note from conclusion (iv) that E{S Y ) — a 2 z (af+(3 2 ). This may be compared with the ideal 
case Var(Yi) = 0\<j 2 z where (/3 ,/3i) 

arc known. It is instructive to reexamine unconditional 
properties of this section if we retain the homogeneity of variances of {Z\, . . . , Z n }, but 
assume instead that means are not, i.e., that E(Z) — fi = \p, l , . . . , fi n ]'. For then we see 
that {VariYi) = n 2 a 2 z + o\ + crf/i 2 ; 1 < i < n}. We have the curious finding that calibrated 
measurements in a single sample will have heterogeneous variances under heterogeneous 
means, despite the homogeneity of variances of {Z\, . . . , Z n }. 

4.2. One— Way Experiments. Clearly {Y\, . . . , Y n } and {Z\, . . . , Z n } have the same ex- 
perimental structure, here a one-way experiment comprising k samples of sizes {ni, . . . , n k }, 
with n\ + . . . + rife = n. In keeping with conventional notation, partition Z' — [Z\, . . . , Z n ] 
as [Z[, . . . , Z' k ], such that {Z\ = [Zn, . . . , Z ini ];l < i < k}; and similarly for Y' = 
[Y[, ...,Y' k ], with {Y- = [Y a , Y ini }; 1 < i < k}; and suppose that {E{Z ij ) = fi { ; 1 < 
j < m} and {Var(Z i:j ) = w?; 1 < j < m}. Accordingly, take fj, z = [n x l' ni , . . . , fi k l' n J 
and V(Z) = Diag( ), say. Specializing from Section 3, 

we have the conditional moments E(Y\P 1 ) = = /3 Q l n + /3 1 {n 1 l' ni , . . . ,/j, k l' n ]' and 

V(Y | 3i) = H(3i) = f 1 Diag(u 2 1 I ni ,...,wll nit ) + a 2 l n l' n , together with C(Y \ \) = 
N n (pb Y {fi l ) ) 'E.(f3 l )) under Gaussian errors. Moreover, unconditional moments are E(Y) = 
fi Y = /3 l n + /3 1 [/i 1 lJ ll ,...,/i fc lJ l J' and V(Y) = S = K 2 Diag{u\I ni , . . . ,w 2 k I nk ) + a\\,X n 
+ a 2 M, where M = [Mij] = fJ-z^'z has the block structure M%j = /^//j l ni 1^ . . In partic- 
ular, for typical calibrated measurements in sample i of the k samples, the conditional and 
unconditional means are {E(Yij | = (3 +(3 1 fi i ; 1 < j < rii} and {EiYij) = /3 +(3 1 fi i ; 1 < 
j < ni} 7 whereas the corresponding variances are {VariYij \ = (3\W 2 + er§; 1 < j < nj} 
and {Var(Yij) = k 2 uj 2 + a\ + ; 1 < j < rii}. We are concerned with the dual issues 
of the homogeneity of means, and of the homogeneity of variances, across the k samples. 
Clearly the induced irregularities are artifacts of the calibration process, rather than conse- 
quences of the experimental structure itself. It remains to determine unintended effects of 
calibration on conventional comparisons among the means and the variances. 

In contrast to conventional one-way experiments, where homoscedasticity can be checked 
regardless of heterogeneity among the k population means, under calibration we see that ho- 
mogeneity of the unconditional variances is possible only in unusual circumstances. Specif- 
ically, homoscedasticity holds unconditionally if and only if, for every pair (u 2 ,fi 2 ) and 
(uj 2 , n 2 ), that (w? - lo 2 ) = c(fi 2 - n 2 ) with c = (t 2 /k 2 . 
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To continue, consider transformations T\(Y), T2(Y), and Ts(Y) such that T\{Y) = Y 
= [Yi, . . . ,Yk}' comprise the k sample means; T 2 (Y) = R' = [R[, . . . , R' k }' consists of 
the ordinary within-sample residuals, with Ri = B ni Yi and B ni — (J nj — l ni \' n .); 
and T 3 (Y) = [fiS 2 , . . . , VkSkY are the residual sums of squares, i.e., {i>iSf = R[Ri = 
Y^B^Yi; 1 < i < fc}, with v i =m — 1. Basic properties may be summarized as follows. 

Theorem 4. Consider calibrated measurements Y = [Y 1 , . . . , Y k ] corresponding to Z' = 
[Z[, . . . , Z' k ] such that E(Z) = [/^l^ , . . . , V k l' nk ]' and V(Z) = Diag(ujjl ni , . . . ,uj 2 k I nk ); 
and letT^Y) =Y = [¥,,..., Y k ]',T 2 (Y) =[R' 1 ,...,R' k }', and T 3 (Y) = [i/jS?, . . . , v k S 2 ]' , 
with {vi = rii — 1; 1 < i < k}. Moreover, a Gaussian model asserts that {(Zij — /j, i )/u) 2 ; 1 < 
j < n-i, 1 < i < k} are iid Ni(0, 1) random variates independently of((3 , [3{) under Gaussian 
calibration. 

(i) Conditional and unconditional moments ofT\(Y) = Y are given by E(Y \I3 1 ) = 0{Pi) 
= /3 lfe+3iM, E(Y) = 9 =/3 l fe +/J 1 /x, V(Y |3i) - Hi(3i) =txDiag{u\ln u . . . ,w£/n fc )+ 
CTolfclfe, and V(Y) = Hi = n 2 Diag(u)\/ni, . . . ,u? k jn k ) + crll k l' k + affifi' , where fi' = 

(ii) Under Gaussian models the unconditional density of CiY) is the translation- scale 
mixture 

/oo 
guiu-e^^^dGiit) (4.4) 
-oo 

with mixing distribution Gi(/3 1 ) = N 1 {(3 1 , a 2 ) as in (2.1), where 6{t) = /3 l k + tn and Hi(i) 
= t^Diag^l/m, . . . , Lu 2 k /n k ) + cr 2 l k l' k . 

(Hi) Conditional and unconditional moments of the residuals are E(R\/3 1 ) = E{R) = 0, 
V(R\j3i) =320!) =f 1 Diag{u 2 1 B ni ,...,u } 2 B nk ), and 

V(R) = H 2 = K 2 Diag(u) 2 B ni ,. . .,u? k B nk ). 

(iv) Under Gaussian models the joint density of residuals R = \R\, ■ ■ ■ , R' k ]' is given by 
f 2 (r; 0, E 2 , G 2 ) as in (4-3), with mixing distribution G 2 (/3 1 ; A) and A = P 2 /cr 2 - 

(v) Under Gaussian models the joint density of elements of [v\S 2 /lo\, . . . , v k Sl/Lo\]' is 
given by 

roc 

f(u;vi,...,v k ) = / T\go(u l ;v l /2,2w)dG 2 (w) 
J o i=l 

withvi = rii — 1 and g o (u;a,w0) = {w(3)~ a u a ~ 1 e~ u l w!i /T(a), having the mixing distribution 
G 2 0\;\) with A = i3\la\. 

Proof. Arguments follow step-by-step as in the proofs given in Section 4.1. The details 
differ, but proceed similarly on noting that Y = Diag(n^ 1 l' ni , . . . ^n^l^^Y = L{Y ', say, 
whereas R — Diag(B ni , . . . , B nk )Y = L' 2 Y . Conditional and unconditional moments follow 
directly as linear functions, together with the idempotencies of {B ni ; 1 < i < k} and the 
annihilations achieved through {B ni l ni = 0; 1 < i < k}. A Gaussian model for C(Z), 
and Gaussian errors during calibration, give conditional Gaussian laws for C(Y | /3 X ) and 
£(R\ P^, whereas unconditional distributions arc mixtures as in Section 2.2. Moreover, 



10 



D. R. JENSEN AND D. E. RAMIREZ 



{Si, . . . , Sj;} are conditionally independent given (3 1 . As in the proof for Theorem 3(iv), 
the marginal density of C{viSf /wf \P X ) is the scaled chi-squared density go(ui; Vi/2, 2w) as 
defined preceding (2.2), with w = f3 1 . Their unconditional density now follows on mixing as 
in Section 2.2, as asserted in conclusion (v), to complete our proof. □ 

It is essential to examine effects of calibration in comparing variances across the k groups, 
typically based on {Sf, . . . , S 2 }. In the ideal case where (f3 , (5{) are known, we would have 
{Var(Yij) = 0\ujf;l < i < k}, so that homoscedasticity across groups for measurements 
{Yij} would be tantamount to that for {Zij}. Under calibration errors, however, Theorem 
2(h) shows that Sf underestimates Var{Yij) = K2OJ 2 + a\ + o~f/J, 2 > the amount of bias, Bi = 
— (<7q + a\ujf), being an artifact of the calibration process itself. Accordingly, it is germane 
to examine homogeneity among the expected values {K2OJ 2 , ■ ■ ■ , ^to 2 ,} of {Sf, . . . , S 2 }. To 
these ends let T^S 2 , . . . , Sf) be any scale-invariant statistic based on the sample variances 
from the measurements {Yy-; 1 < j < rij, 1 < i < k}. The following result is basic. 

Theorem 5. Let {Sf , . . . , Sf} be within-sample variances from the calibrated measure- 
ments {Yij-,1 < j < rii, 1 < i < k} in a one-way experiment; let T±(S 2 , . . . , Sf) be any 
scale-invariant statistic; and consider a Gaussian model where {(Z^ — [i^)luj{; \ < j < 
n-i, 1 < i < k} are iid N\(0, 1) random variables independently of (f3 ,f3 1 ) under Gaussian 
calibration errors. Then the distribution ofT 4 {Sf, . . . , Sf) is identical to its normal-theory 
form, independently of ((Sq,/^) and the empirical calibration line. 

Proof. The proof for Theorem 4(v) asserts fiu; v\, . . . , v^) = n»=i 9o( u ii v i/2, 2) as the 
conditional density for C(v\S 2 / ' (3 1 lo\, . . . , v^S\f li-^J 2 , \fli), with Vi — ni — 1 and go(u; a, (3) = 
u a ~ 1 e~ u / /3 /p a T(a), so that {Sf, . . . , S%} are conditionally independent given (3 1 . But since 
T A (Sl, . . . , Sf ) is scale-invariant, C[T 4 (Sf, . . . ,Sf) |3i] = C[T 4 (Sf, S^] unconditionally, 
to complete our proof. □ 

It deserves note that meaningful comparisons among variances are necessarily scale- 
invariant. Moreover, it is seen that procedures based on {Sf , . . . , S%} support tests for 

^ ^2 

conditional hypotheses that {VariYij | /3 X ) = /3 1 w 2 + erf, 1 < i < k}, or equivalently, H n : 
uj\ = oj 2 , = ■ ■ ■ = u\, to be tested against alternatives as appropriate. Theorem 5 applies in 
the case of both null and non-null distributions of invariant test statistics. Tests in common 
usage include 

• Modifications of Bartlett's (1937) likelihood ratio test; 

• Cochran's (1941) test based on S 2 ma J{S\ + ... + Si); 

• Hartley's (1950) F-m&x test based on the maximal ratio max{S 2 / S 2 }; and 

• Gnanadesikan's (1959) simultaneous comparisons of treatment variances with a con- 
trol. 

To examine effects of calibration errors on the one-way analysis of variance for comparing 
means, we proceed conditionally given /3 1 , first assuming that {VariYij \ (3^ — P^ 2 + 

oh 1 < 3 < n u 1 < i < k}, so that V(Y | /?i) = 3i^ 2 ^n + o-§l„l^ = H(/?i) in the 
notation of Section 3.2. We are concerned with comparative inferences regarding elements 
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of (1(0!) = . . . , fi k ]' from E(Y) = ftl„ + ftb^l^, . . . , /J- k l' n J. Recall that I n = 

A + A 1 + A 2 partitions YI n Y = Y'A Q Y + Y ' A X Y + Y' A 2 Y such that Y'A Q Y = nY 2 , 
with Y as the grand mean and A Q = n- x \ n V n , Y' A{Y = Ylt=i n i {Y l -Y) 2 ; and Y'A 2 Y = 
J2i=i Sj=i(^ — Yi) 2 ■ To validate the Fisher-Cochran theorem conditionally requires that 
{AiS(f3 1 )Aj — 0; i ^ j}. Moreover, scale parameters associated with the quadratic forms 
are found as {ufAi = -4.jE(ft)Ai; i — 1,2,3}, whereas noncentrality parameters derive 
from expected mean squares. This program of study is carried out next in support of the 
following. 

Theorem 6. Let {Yij = ft + P 1 Z i j; 1 < j < rii, 1 < i < k} be calibrated measurements in a 
one-way experiment such that {(Zij — fi^/uj; 1 < j < rii, 1 < i < k} are iid iVi(0, 1) random 
variables independently o/(ft,ft) under Gaussian calibration errors. 

(i) The analysis of variance test for equality of elements of /x(ft) — ft!/*!, ■ ■ ■ , M fe ]', 
pertaining to the group measurement means, is identical in level and power to its normal- 
theory form. 

(ii) Supporting tests, based on linear contrasts among the group means, are identical in 
level and power to their normal-theory forms. 

Proof. To validate the Fisher-Cochran theorem conditionally, observe {A i H(/3 1 )A = 
0;i = 1,2}, since A;E(/?i)A = A 4 (ftu 2 J„ + o- 2 l n l' n )A and {A.Aq = 0;i = 1,2}. 

^ ^ ^2 

Similarly {A^p^Aj = 0; = 1, 2, i ^ j}, since AiE(ft)A; = A i (/3 1 uj 2 I n +nalA )A j 
and {AiAj = 0; (i, j) = 0, 1, 2, i ^ j} from standard properties of the one-way classification. 
Scale parameters, as determined from {njAi = Ai3(f3 \) Ai, i = 1,2}, are found to be 

^2 ^2 

equal, namely {n 2 Ai = A^P^ 2 I n + o-Ql n l' n )Ai = (3 1 uj 2 A i ;i — 1,2} from idempotency 
together with the annihilation {Ail n = 0;i = 1,2}, so that {k 2 = n 2 ;i = 1,2}. Finally, 
the noncentrality parameters and degrees of freedom associated with {Y'AiY; i = 1,2} are 
determined from their expected mean squares. These are {E(Y' AiY \ ft) = tr(AiS(f3 1 ) + 
[n0 1 )]'A i iJ,0 i y,i = 1,2}. It follows directly that 

k 

E{Y' A{Y | ft) = triA&fa) + [ M (ft)]'A lM Cft) = (k - 1)« 2 + ft £ n ^ ~ ^ 

i=l 

with fi = Ei=i n iVi/ n - Similarly E(Y'A 2 Y | ft) = tr(A 2 E(ft) + [/x(ft)]'A 2 /x(ft) = 
{n—k)n 2 since [/z(ft)]'A 2 /z(ft) = ft J2i=i Sj=i(Mi — Mi) 2 = 0- From these developments wc 
infer that the distribution of the ratio F = (n— k)Y' ' A{Y '/(k - 1)Y' A 2 Y satisfies C(F | ft) 
= F(k- l,n- fc,A(ft)) with A(ft) - ft EtiM^ ~ tfltiu? - EjUnifo - ^)> 2 . 
Thus the conditional and unconditional distributions are identical, i.e. £(F|ft) = £(F) — 
F(k -l,n- k, A), with A = E t fe =i™iO^ - t 1 ) 2 /^ 2 . □ 

5. Diagnostics 



5.1. Objectives. Calibration errors exact profound disturbances, both in models and in 
data-analytic procedures, as shown. Myriad calibrated data sets have been analyzed to 
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date, supported of late by an evolving battery of diagnostic tools. On these grounds it 
is tempting to dismiss the present study as academic: For surely these issues long since 
would have surfaced in practice, to be addressed accordingly. At issue is the capacity of 
known diagnostics to uncover calibration-induced irregularities as documented here. We 
now address these concerns with regard to induced correlations, nonnormality, mixture 
distributions having excessive tails, and possible outliers. For definiteness we return to the 
case of a single sample as in Section 4.1. 

^2 

5.2. Correlation. Neither the conditional {o\/{/3 x g 2 z + er§)) nor the unconditional ((er§ + 
cr iA*|)/( K 20'| + <7q + (t\ijl 2 z )) correlations need be negligible. Tests for correlation entail 
dispersion matrices V(Y) = t 2 S(uj), for which S(u>) = (J„ + loA) with A fixed and 
S(w) £ §+. Specializing gives t 2 S(uj) as £(/?) under the equicorrelation models encoun- 
tered here. Tests of note are due to Durbin and Watson (1950, 1951, 1971), Anderson and 
Anderson (1950), Theil (1965), and others, all based on versions of von Neumann's (1941) 
ratio U = R' BR/ R' R, with R as the observed residuals and with B(n x n) fixed. For 
further details see Kariya (1977). However, here the unconditional distributions are all iden- 
tical to their normal-theory forms as if C(R) = N n (0, a 2 B n ). This is seen from the proof 
for Theorem 3(ii), where C(R \ (3^ — N n (0, f3 x o 2 z B n ), together with the scale-invariance 
of U = R'BR/R'R, assuring that £(R BR/ R = C(R'BR/R'R) unconditionally. 
All such diagnostics for correlative dependencies are totally blind, both to the conditional 
\V{Y\P X ) = /Vl-Tn + vllnl' n } and unconditional [V(Y) = n 2 a 2 z I n + (erg + alfi 2 z )l n l' n ] 
dispersion structures. In short, demonstrated calibration-induced correlations cannot be 
discerned through conventional diagnostic tools. 

5.3. Nonnormality. Diagnostics for normality encompass both graphical and hypothesis 
testing procedures. Graphics include plots of ordered residuals against their normal-theory 
expectations. Common usage includes the scaled residuals {Ri/Sy',1 < i < n}, or the 
Studentized residuals {WiRi/Sy',i = 1,2, ... ,n}, standardized so that Var{WiRi) — o\. 
See Sections 2.12 and 5.7 of Myers (1990), for example. In calibrated data these residual 
plots are indistinguishable from those for the conventional Gaussian model N n (/jl n ,a 2 I n ), 
whatever be the joint mixture density at (2.1) for the calibrated measurements. This follows 
since C(R/(RR)2\f3 1 ) = C(R/{R'R)z) from scale invariancc, the latter as a scaled singular 
multivariate Student's t-distribution having v = n — 1 degrees of freedom, depending on 
neither (3 1 nor a\. 

Tests for normality include the regression tests of Shapiro and Wilk (1965), known to 
be powerful against a wide range of alternatives, to include skewed or distributions having 
short or very long tails, even in small samples. See Royston (1988), for example. These tests 
utilize statistics W = (£™ =1 WiY [t] ) 2 /(n - 1)S$, where {Y m < Y [2] < . . . < Y [n] } arc the 
ordered values of {Yi, . . . , Y n }, and {w\, . . . , w n } are fixed weights. Such tests would appear 
promising for detecting the nonstandard mixture distributions of calibrated measurements, 
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where 

w _ (ELi^m) 2 _ (3oELi^ + 3iELi^^]) 2 ,,,, 

(n-l)S Y (n-l)S* ' 1 J 

However, since E™=i w i — for these tests, together with the identity S Y — 0\S%, it fol- 
lows that W = (£" =1 u^[ 4 ])7(n - Then £(VK | 3i) = -C(VT) holds unconditionally 
from cancellation. In short, all such regression tests fail to distinguish between Gaussian 
distributions, and the Gaussian mixtures of type (2.1). With regard to further variations on 
regression tests, as in D'Agostino (1982), similar arguments show that none is able to dis- 
tinguish between Gaussian distributions and their mixtures from calibrated measurements. 
Given sample moments {m r — E™=i(^ — Y) r ;r — 2,3,4}, tests based on the moment 
ratios {&i = m^jm^, 6 2 = ni4 /m|} are useful against skewed alternatives or distributions 
having excessive or short tails (D'Agostino (1982)). It is readily shown that these ratios are 
precisely those obtainable from {Z\, . . . , Z n }, so that their null distributions are identical 
to those for which CiY) = N n {fi\ n , a 2 I n ), whatever be the joint mixture distribution as 
in (2.1). On the other hand, the foregoing tests do offer a clear check on normality of the 
distribution of {Zi, . . . , Z„}, on which the mixtures (2.1) are predicated. 

In short, conventional Gaussian diagnostics are bereft of any capacity to distinguish 
between Gaussian errors, and Gaussian mixtures of type (2.1). Thus radical calibration - 
induced departures from Gaussian models cannot be discerned through routine screening 
using any of these diagnostics. 

5.4. Outliers. Commonly used diagnostics for a shift in location or scale at observation 
Yi include the Studcntized residuals U — Ri/Syy/ (1 — and the R-Student deletion 

diagnostic Rt{ = R4/ S-i^J '(1 — 1/n), where S-i is the sample standard deviation found on 
deleting Y t from {Yi, . . . , Y n }. As mixture distributions may have heavy tails, and since 
conventional diagnostics for normality have failed, it is natural to ask whether outlier diag- 
nostics might be sensitive to observations from mixtures of type (2.1). If so, then evidence 
for apparent outliers in calibrated data instead might be attributable to the calibration 
process itself. However, these diagnostics are all scale-invariant functions of the observed 
residuals {Ri, . . . , R n } 7 so that they are indistinguishable from statistics derived from the 
standard Gaussian model N n (/j,l n , a 1 J„). In short, conventional outlier diagnostics cannot 
distinguish between Gaussian errors, and heavy-tailed mixtures as in (2.1), even if a shift 
in location or scale has occurred at observation Yi. 

Section 5 has reexamined whether conventional diagnostics can detect calibration-induced 
anomalies, to include correlations, nonnormality, distributions having excessive tails, and 
possible outliers. Even radical departures from conventional assumptions cannot be dis- 
cerned through routine screening using any of the aforementioned diagnostics. In summary, 
the present study cannot be dismissed as merely academic, as evidence for anomalies trace- 
able to calibration could not have surfaced in practice through a battery of diagnostic tools 
as it has evolved to date. 
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6. Case Studies 

We apply the results of Section 4.1 to a numerical data set under the assumptions of 
Theorem 3. Table 1 gives the percent of purity (X) and the octane number (U) from a 
sample of n = 11 different gasoline production runs. Percent purity is determined readily, 
whereas octane numbers require expensive and time-consuming dynamic laboratory tests; 
hence the need for calibration. 
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Table 1. Percent of purity (X) and octane number (U) of gasoline 



X 


99.8 


99.7 


99.6 


99.5 


99.4 


99.3 


99.2 


99.1 


99.0 


98.9 


98.8 


u 


88.6 


86.4 


87.2 


88.4 


87.2 


86.8 


86.1 


87.3 


86.4 


86.6 


87.1 



The least-squares fit for U = /3 +l3 1 (X-X)+e has {n = 11, % = 87.2818, a = 0.1846, (3 1 = 
1.8546, (Xi = 0.5837}. Suppose that subsequent determinations of percent purity satisfy 
{C(Zi) = Ni(0, 1); 1 < i < n}, so that calibrated measurements are recovered as {Yi = 
/^o + /^i^; 1 — * — n } m units of octane number. Then the distribution of Y is the mixture 
of a normal distribution Ni(fj,(t), S(t)), with /x(t) = O + tfi z and E(t) — t 2 a%/n + a\, 
having the density gx(u;fj,(t), S(t)), with mixing distribution Ni(j3 1 ,a 2 ) having the density 
dG\(t). For convenience, we write this as C{Y) = Ni({3 + tfi z ,t 2 a' z /n + cr§)A t A^ 1 (/? 1 , a\), 
where A t designates the mixing operation. Accordingly, the density of Y is 



h{u) 



9i{u;l3 +tfj, z ,t a z /n + a^)dGi{t) 



exp 



[u-(p +t^ z )] 2 



exp 



2cr? 



^2n{t 2 a 2 z /n + al) ^2t 



dt 



(6.1) 



a function of the parameters £1 — {n, (3 Q , gq, fi z , az, @x> with skewness 0.1464x 10 -6 and 
kurtosis 3.855, and with conditional mean E(Y) — 87.2818, given the empirical calibration. 

Using equation (|6.ip . we compute the 95% probability region for Y as (86.037,88.526) 
compared to /3 ± 1. 96 j /3 \\az/vll = (86.184, 88.376) if Y were normal. This latter interval 
is actually a 92.2% probability region. In addition, the density fi(u) is bell-shaped but is 
not normal. The Table 2 gives its moments (mean, variance), and moment ratios (skewness 
(7), kurtosis («)) for selected values of the parameters fl. 

The scaled sample variance is a mixture of a gamma distribution, Gq(-, •), with mixing 
distribution dG2{w) as a non-central chi-squared distribution, to give £((n— l)Sy / 'g\p z ) = 
G ((n - l)/2, 2t)AtX?(A = Pl/crf) with E{S^/a%u\) = (1 + A) which for the Octane 
Data is the conditional value E{S\)la z a\ = 11.095, so that E(S%) = 3.780. The mixture 
distribution has density 



fo(u) 



u 



v/2-1 



: 2^ 2 r(i//2) 



w-"l 2 e- u l 2w dG 2 {w) 



w -v/2 e -u/2* 



-A/2-U/2 
""2V2 



j^A 1 j\T(l/2 + j 



-}dw (6.2) 



with v = (n — 1). 

If Y were normal with /3 1 a constant, then a 95% probability region for Sy could be 
found from P[x 2 (10; 0.025) < (n - l)S Y $\a% < x 2 (10; 0.975)] or equivalently P[1.1167 < 
Sy < 7.0449] which actually is a 74% probability region when variation in (3 1 is taken into 
account. The correct probability region is found by numerically integrating Equation (|6.2j) 
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to get P[10.8 < (n - \)S\ja\a\ < 336.5] = 0.95 = P[0.3680 < S£ < 11.46]. Wc find that 
using the first 20 terms in the infinite sum is adequate. 

Table 2. The moments (mean, variance), and moment ratios (skewness (7), kurtosis («;)) 
of C(Y) for selected values of the parameters = {n, O , <tq, h z , oz, /3 l5 o\\ 



n 


Pn 

1 U 




1x7 


a z 


#1 

" 1 






E(Y) 


Var(Y) 




K 


10 


1 


1 


1 


1 


1 


1 




2.0000 


2.2000 


0.1839 


3.2851 


20 


1 


1 


1 


1 


1 


1 




2.0000 


2.1000 


0.0986 


3.1463 


























20 


.5 


1 


1 


1 


1 


1 




1.5000 


2.1000 


0.0986 


3.1463 


20 


2 


1 


1 


1 


1 


1 




3.0000 


2.1000 


0.0986 


3.1463 


























20 


1 


.5 


1 


1 


1 


1 




2.0000 


1.3500 


0.1913 


3.3539 


20 


1 


2 


1 


1 


1 


1 




2.0000 


5.100 


0.0260 


3.0248 


























20 


1 


1 


.5 


1 


1 


1 




1.5000 


1.3500 


0.0956 


3.1070 


20 


1 


1 


2 


1 


1 


1 




3.0000 


5.1000 


0.0521 


3.0940 


























20 


1 


1 


1 


.5 


1 


1 




2.0000 


2.0250 


0.0260 


3.0373 


20 


1 


1 


1 


2 


1 


1 




2.0000 


2.4000 


0.3327 


3.5417 


























20 


1 


1 


1 


1 


.5 


1 




1.5000 


2.0625 


0.0506 


3.1463 


20 


1 


1 


1 


1 


2 


1 




3.0000 


2.2500 


0.1778 


3.1452 


























20 


1 


1 


1 


1 


1 


.5 




2.0000 


1.3125 


0.0499 


3.0267 


20 


1 


1 


1 


1 


1 


2 




2.0000 


5.2500 


0.0998 


3.3614 


























10 


1 


.5 


1 


2 


1 


2 




2.0000 


6.2500 


0.6144 


5.5559 



For the density of = n(Y - ^) 2 /S^, set 

{v = n-l,5 = 0*y- &?/ctW z , 50,) = 5/0\/a\)}. 

Its density is found on mixing the non-central t 2 {v,5/t) over a non-central chi-squared as 
the mixing distribution, which we write as £(tg) = t 2 (v, 5/t)k t x\ (A = (3\/a\). The density 
is given by 

g(u;v,S,X) = 

fig ^zg)±i qzv ^f± dt . ( 6 .3) 

For the first sum, we use the first Ni — 15 terms, and for the second sum the first N2 = 30 
terms. 
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This distribution is useful for computing the power of the t 1 test. For example, with 
n = 11 so v = 10, the 95% critical value is 4.9646 with 6 = 0. Table 3 gives the power of 
the test for 6 = {0, 1,4,9} and A = {1,4,9}. 

Table 3. Power for the test Ho : \i Y = fi Y against Hi : fi Y My 
for 6 e {0, 1, 4, 9} and A e {1, 4, 9} 



A = 


1 


4 


9 


6 = 


.950 


950 


950 


6 = 1 


.691 


.863 


.928 


6 = 4 


.485 


.742 


.876 


6 = 9 


.329 


.608 


.799 



For the Octane Data, the power of the test of Ho : \x Y = (i Y , with (n Y — n Y ) 2 = 1, has v = 
10, 5= fay- [i Y f I o\g\ = [(1)(0.5837)]- 2 = 2.9351, and A = @\/a\= (1.8546/0.5837) 2 = 
10.0953. The power of the test is 90%. 

An equivalent form for is based on to as C(t a ) = t(v, 6o/s)h. s ^J xf(A = 0\/a\), 

mixing over a shifted half-normal distribution. Its density is 

ft (u) = 

r e~^r((, + l)/2)(^)^ » r((^ + j + l)/2) ( V2u6q/ S V r- 

Jo v^m u wwrm {7^) dVG2{s) (6 - 4) 

having non-centrality parameter <5 = \/5 and 

e -(s-Ao) 2 /2 +e -(-s-X a f/2 

rf\/G 2 (s) = == ds 

V2t7 

with non-centrality parameter A = V\. This series has faster convergence and we used 
N = 20 terms in the forgoing power calculations with noncentrality parameters 6 = V6 
and A = V\. Computations reported here were executed by the second author using the 
Alaple software package. 

7. Conclusions 

In summary, the widespread and necessary use of calibration may have devastating effects, 
even on elementary data-analytic procedures pertaining to location and scale parameters. It 
is unfortunate that these difficulties cannot be flagged by the ever expanding use of available 
diagnostic tools. It thus is incumbent on knowledgeable users of statistical methodology, 
and the statistical consultants advising them, to assess the extent of these difficulties as they 
might impact the analysis and interpretation of data in a particular experimental setting. 
Let the user be forewarned. Fortunately, comparisons among means and among variances, 
in the context of comparative one-way experiments, are largely unaffected by the use of 
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calibrated instruments when subject to errors of calibration, provided that the results are 
interpreted accordingly. 
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8. APPENDIX 

It is germane to examine the comparative concentration of probability measures on R n . 
Following Sherman (1955), the measure fi(-) is said to be more peaked about Oel™ than 
v(-) if and only if ^(A) > v{A) for every set A in the class C(n) comprising the convex sets 
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in R™ symmetric under reflection through € R™. For scale mixtures of Gaussian measures 
on R™, their peakedness ordering is tantamount to the stochastic ordering of their mixing 
distributions. Details follows. 

Lemma 2. Let GM n (9,S,G\) and GM n (9,S, G 2 ) be Gaussian mixtures on R™ of type 
(2.2) having mixing distributions Gi(-) and G 2 (-) on R+. Then GM n (9,a,Gi) is more 
peaked about 9 e R" than GM n {8, H, G 2 ) if and only if Gi(t) < G 2 (t) for every t > 0. 

Proof. The ordering G\{t) < G2(t), i.e., that Gi(-) is stochastically larger than G 2 (-), holds 
if and only if there are increasing functions {V'iG); V^O)}' ordered pointwise as ipi(t) > 
tp 2 (t), together with a random variable U, such that G\(t) = P(i/» 1 (i7) < t) and G2(t) = 
P(tp 2 (U) < t); see Lemma 1, page 84 of Lehmann (1986), for example. Accordingly, we 
provisionally write n(A) = J A f(x;9,S,Gi)dx and v{A) = f A f(x;0,3,G 2 )dx, and their 
difference as 



Given that G\{t) < G 2 (t), so that ViW > ^2 (*)^ the ordering J A [g(x; /x, H/^ 1 (t)) — 
^(x; /x, E/-0 2 (t))]cfe > follows point-wise for each fixed t e R+ from Corollary 3 of Ander- 
son (1955), since S/i/j 2 (t) >l ^/ipi (t)) uniformly in t. That [n(A) — v(A)] > now follows 
directly. Conversely, suppose that fJ-(A) > v(A). We now apply the converse to Anderson's 
(1955) Corollary 3, as proved in Jensen (1984), to infer that Vi(*) > "02 W f° r eacn * > 0, 
thus establishing the necessity of the condition G\(t) < G 2 (t), to complete our proof. □ 
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[g(x; 9, EM(i)) - g(x; 9, S/^ 2 (t))]dxdG(t). 



